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Abstract A novel coronavirus strain was isolated from laryngotracheal swab of wild partridge and designated as partridg(0 GEO 
SI 40 20030 SI40- Its whole genomic sequence was obtained!) GenBank Accession numberO AY6462830 through RT-PCR amplificationO 
cloningO nucleotide sequencingO and analysis by the DNASTAR program. To investigate the origin of the virusO we further analyzed the 
nucleotide sequences of the main structural proteinsO and compared those with other available virus isolates. Our results showed that the 
highest nucleotide homologies between the SI gene of S14 strain and those of nephrogenic-type strains JX1-99 and TJ2-96 were 94. 6% 
and 93.4%D respectively. In additionD a relatively high genetic identityD 85% and 84. 3%D respectivelyD was detected between SI gene 
of SI 4 and those of strains QXIBV and LX4. The results suggested that the SI4 strain may be originated from or related to nephrogenic- 
type and proventriculus-type infectious bronchitis virud IBVD- The highest nucleotide homology between the S2 gene of SI 4 strain and 
those of QXIBV and LX4 was 85% and 84. 3% D respectively and all of them belonged to group II corona viruses. The highest nucleotide 
homology between the M gene of S14 strain and those of strains SAIB20 and GD6-98 was 90.6% and 90.2% 0 respectively by which S14 
belonged to group III. Although they displayed high level of genetic identity in SI and S2 geneQ there was lower homology of M coding se¬ 
quences between SI4 and BJG and between SI4 and QXIBV strains. Phylogenetic analysis of N gene indicated that group I strains might 
evolve from RNA recombination between strain H52 and GrayD while group II strains from strain HI20 and D1466. SI 4 strain had the 
highest N gene homology with strain QXIBV which was 95. 7%0 thus classified as a group III member. Strains SAIB20 and GD6-98 
which were closely related to the M gene of SI4 strain belonged to group I and IVQ respectively. A possible role of partridge SI4 strain 
may play in the process of coronavirus evolution is discussed. 

Keywords!!] wild birdD corona virusO structural protein genesD virus evolution. 


Coronavirus exists widely in various kinds of do- 
>mestic and wild animals. They often result in respira¬ 
tory or intestinal illness including vomitG diarrhea and 


tn- 
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C/3. 
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•geven death of the infected animals 1 . In this studyG 
Pone wildbird coronavirus was isolated and character¬ 
ized from a total of 1510 laryngotracheal swabs and 
J>83 tissue samplesD which were collected from wild 


Reference strainsD Respiratory-type strains 

eluded M41 □ SI M21883D S 2 Q X04722D 
AF286184Q ND M28566DD Ark99D SD L10384DND 
M85244 □□ SD1-97D AY325732 H] DE072 □ MD 
AF202999QND AF203001DD D146d SDM21971DMD 
AF203004D ND AF203006 □□ 

AY251817Q M0 AY325734 


HaNl-95 □ SID 
HaN2-95 


I 




poultry distributed in various areas of Guangdong 
^ProvinceG China. This strain was closely related to a- 
Qvian infectious bronchitis virus but distantly to SARS- 
CoV by phylogenetic comparison with viruses using 
genomic sequence data available in GenBank IM . 

1 Materials and methods 

1.1 Materials 

VirusesG Partridge coronavirus G designated as 
S14Q was isolated and identified by the Key Laborato¬ 
ry of Poultry Feeding and Diseases ControlO Ministry 
of AgricultureO China. 


□ AY325728 00 GX1-98 □ SID AY319302D MD 
AF325727D ND AY278110 □□ GX2-98 □ SID 
AY251816D MD AY325733D ND AY251816DD H 120 
D SD M21970D MD AY028295D ND AY028296DD H52 
D SD AF352315D MD AF286185 D- CU-T 2 D SD 
U04739D MD U46035D ND U04803D was a respirato¬ 
ry mutant typeO nephrogenic-type strains included 

JX1-990 AF210735DD SAIBwjD AF397528DD Holte 
D SD L18988D AF334685DD Gra>{] SID L14069D S2D 
AF394180 DD KB8523 D M21515 DD X D SID 
AF427819D ND AY043315DDBJD SD AY3196551D- 

Proventriculus-type strains included Q1 

D AF286302DD J2D AF352312DD T3D AF227438DD 


* Supported by National Natural Science Foundation of ChimQ Grant No. 39893290D 
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QXIBV □ AF193423 H] LX4 □ SO AY1851970 NO 
AY338732QHZJ971Q SO AF3523110 NO AF352308Q. 

Specific-pathogen-free D SPF D embryonated 
chicken eggsQ Nine-day-old SPF embryonated chicken 
eggs were kindly provided by Experimental Station of 
Veterinary MedicineD South China Agricultural Uni¬ 
versity. 


Allantoic fluids were collected asepticallyD centrifuged 
at 4000 rpnflmin at 4°CD and then virus genomic 
RNA was extracted from the supernatant according to 
the protocols of the Total RNA Extraction Kit III 
D Sino-American Biotechology Ltd. G- 

1.2.2 RT-PCR amplificationG cloning and sequenc¬ 
ing of virus genome segments RT-PCR amplifica- 


Total RNA Extraction Kit III was purchased 
from Sino-American Biotechnology Co. AMV XLG 
DNasinQ 3'-Full RACE Core SetQ 5'-Full RACE Core 
^SetG P0 NQ6Q and Ex Taq enzyme were provided by 
TTaKaRa Biotechnolog>0 DalianG Co. G Ltd. 

£l.2 Methods 

<D 

gjl. 2.1 Virus propagation and RNA 

^extraction SI4 strain virus was inoculated to 10- 
Uday-old SPF embryonated chicken eggsG followed by 
^incubation at 37°C for 48 hours. Chicken embryos 
Sthat died in 24 hours post-inoculation were discarded. 

H 

PQ 


tion and cloning of virus genome segments were car¬ 
ried out according to the methods of Ren et alP 
The extracted viral RNA was desolved in 11. 5 /xL 
DEPC-treated water and then reversely transcribed. 
The 3 - and 5 -Full RACE were performed according 
to the recommendations of the manufacturer 
G TakaraG- Positive clones were sent to Shanghai 
BioAsia Biological Technology Co. G Ltd. for nu¬ 
cleotide sequencing. 

A total of 29 pairs of primerfl Table 1G were de¬ 
signed for cloning and sequencing of genomic seg¬ 
ments and synthesized by Shanghai BioAsia Biological 
Technology CO. G Ltd. 


Table 1. Primer pairs designed 


cd 

jh Forward 

cd_ 

^ 1 5'-CAA GCC TAA AAC AGG GAG TAT C-3' 

g 2 5'-GGC ATG CGA AAT GAC TTA AC-3 

eg 3 5'-GAG AAA ATG TGT GAC GAC CT -3' 

^ 4 5'-TGG ATG AGT TTA AAG AGT TCT G-3' 

^ 5 5'-ATG GTG TTA AAT ACC GCT C-3' 

° 6 5' GCA GAA CAC CTT AAT AAA CCT-3' 

•g 7 5'-GCC ATC TAT TAT GCG CAA -3' 

£ 8 5'-TCC TAC TTG GTT TAA TAG AGA AA-3' 

'M 9 5'-GGC AAT GAG TTT GTT GGT AAT -3' 

^ 10 5'-TGC ATT ACACACTGG AACTGA-3' 

-o 11 5'-TTT GAC TAA gCT TAA TGT TGA A -3' 

o 12 5'-CAT ATT CAA CAG TTG TTT GGA-3' 

g 13 5'-AAT ACC TAC TAC GGA RAA AGA CC-3' 

g 14 5'-CTT TAA GTA CTG GGA TCA AGA GTA-3' 

o 15 5'-GAT AAG AGT GCW GGC TAY CC-3' 

Q 16 5'-TGT TCW CAA CAY ACA ATG CTA GT-3' 

17 5'-GTA TCT AAT GGT ACY GTT TTT GG-3' 

18 5'-GTC MTT TAT TAA TGG TAA GAT AAA-3' 

19 5'-GTT CTT GTG GTT MTA GAG CTA CAA C-3' 

20 5-AAY ATT CGC ACA CTG CCA AA-3' 

21 5'-TTG AAA ACT GAA CAA AAG ACA-3' 

22 5 -CAG TTT GTA GTT TCT GGT GG-3' 

23 5'-TTG TCW AAA TGG TGG AAT GA-3' 

24 5'-CCT AAG AAC GGT TGG AAT-3' 

25 5' -TGA CCA ARG CGG AAA TAA GA-3' 

26 5' -GAT GGT ATA GTG TGG GTT-3' 

27 5'-GAT GAC AAG ATG AAT GAG GA-3' 


Reverse bp 

5'-GTT AAG TCA TTT CGC ATG CC -3' 940^ 

5' - AGG TCG TCA CAC ATT TTC TC-3' 941 

5-CAG AAC TCT TTA AAC TCA TCC A-3' 1421 

5' -GAG CGG TAT TTA ACA CCA T-3' 988 

5' - AGG TTT ATT AAG GTG TTC TG-3' 800 

5'-TTG CGC ATA ATA GAT GGC-3' 677 

5' -TTT CTC TAT TAA ACC AAG TAG GA-3' 1034 

5' - ATT ACC AAC AAA CTC ATT GCC-3' 962 

5' -TCA GTT CCA GTG TGT AAT GCA -3' 926 

5'-TTC AAC ATT AAG CTT AGT CAA A-3' 1150 

5' -TCC AAA CAA CTG TTG AAT ATG -3' 1730 

5-GGT CTT TYT CCG TAG TAG GTA TT-3' 1628 

5' -TAC TCT TGA TCC CAG TAC TTA AAG-3' 741 

5 -GGR TAG CCW GCA CTC TTA TC-3' 1324 

5' - ACT AGC ATT GTR TGT TGW GAA CA-3' 453 

5-CCA AAA ACR GTA CCA TTA GAT AC-3' 696 

5' -TTT ATC TTA CCA TTA ATA AAK GAC-3' 1298 

5'-GTT GTA GCT CTA KAA CCA CAA GA-3' 481 

5'-TTT GGC AGT GTG CGA ATR TT-3' 1326 

5'-TGT CTT TTG TTC AGT TTT CAA-3' 871 

5'-CAT AAC TAA CAT AAG GGC AAT-3' 1372 

5' -TCA TTC CAC CAT TTW GAC AA-3' 840 

5' - ATT CCA ACC GTT CTT AGG-3' 657 

5' -TAC TCT CTA CAC ACA CAC -3' 632 

5' - AAT GAA RTC CCA ACG GAA AT-3' 1290 

5'-CTC CTC ATT CAT CTT GTC -3' 1644 

5-AAG CTA CAT GCC TAT CTT CC-3' 1010 


28 5'RACE1-F5'-CAT TTA GAA GAC ATC TTT GGT 5' RACE1-R5 -TGT GAC GTA TAG AAA AAC 604 

GTG3' AAA GCG TCA C-3' 

29 3'RACE-F5 -GGA AGA TAG GCA TGT AGC TT - 3'RACE-R5'-CTG ATC TAG AGG TAC CGG ATC 332 

3'_UT_ 
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Analysis of nucleotide sequence was carried out 
by the BLAST program and compared with published 
relevant sequences available in GenBank IM . A num¬ 
ber of representative viruses from different geographi¬ 
cal locations were selected and analyzed by DNAS- 
TAR software package. 

2 Results 


position 221—233D T at 236D A at 262Q ATGGCC at 
415—421 were identified. There were similar muta¬ 
tions in LX4 and QXIBV strainsG inserting TC and 
AG at position 360D respectively. There were also 
some nucleotide substitutions in SI4 strain. Com¬ 
pared with position 25 of M41 strainG the deduced 
amino acid sequence of SI4 had an AsrQ ND insertion 
at this point and G1)Q GG + Sef] SD at position 119. 


2.1 RT-PCR amplification of SI 4 genome 

SI4 strain was used for RNA purification and 
i2RT-PCR amplification as described above. As shown 
<^in Fig. 1G we successfully obtained the cDNA frag- 
Aments by genomic terminal RNA RACED lane ID 
£29GD or routine RT-PCR. 



Fig. 1. RT-PCR amplification of SI4 whole genomic RNA. MD 
T Marker-DL2000D lane IDS RACED lanes 2—28D cloning segments 
using the primer pairs as shown in Table ID lane 29D 3' RACE. 


'g2.2 Analysis of nucleotide sequences of SI protein 
>genes and deduced amino acid sequence 

‘3 

T SI hypervariable regions of the two strainfl M41 
^and S14G were obtained by sequencingG which includ¬ 
ed the nucleotides from start coden ATG to the cleav¬ 
es 

^age site of S precursor protein. SI gene of SI4 strain 
|was 1620 bp longG encoding 540 amino acids and con¬ 
taining 19 potential N-glycosylation sites. The G + C 
percentage was 35.49%. Hydrophilic analysis of de¬ 
duced amino acid sequences of SI protein showed that 
the first 20 N-terminal amino acid residues of SI pro¬ 
tein of SI 4 virus formed a hydrophobic regionG which 
was presumed as the signal sequence of S precursor 
protein. Amino acid sequences at the cleavage site of 
S precursor protein were RRFRR. 

Compared with reference strain M41G there were 
several insertions and deletions in SI 4 strainG a TCT 
insertion at position 67G ATTA at 216G ATAATCCA 
at 245D A at 353D G at 266D A at 350D AG at 360D 
TCT at 362D ATGG at 420 and CT at 429. On the 
other handG the deletions of ATTCATGGTGGT at 


Fig. 2 shows four groups of infectious bronchitis 
virusedG IBVG. Group I could be divided into two 
subgroupsG in which GrayG Vic SG JMK strains be¬ 
long to subgroup A and others belong to subgroup B. 
Group II could also be divided into two subgroupsG 
with S14G JX1-99G and TJ2-96 strains classified to 
subgroup C and other strains to subgroup D. Sub¬ 
group B strains had high genetic identity with strains 
H52G H120 and M41. It was speculated that Group I 
viruses were mutated from vaccine virus. All Group 
II viruses were isolates from China including pub¬ 
lished nephrogenic-type and proventriculus-type 
strains. S14 strain was also found in this group. Gx2- 
98 isolate from China of Group III had close identity 
with strain HolteG and it was deduced that it might 
be mutated from Holte. Group IV strains were 
proventriculus-type infectious bronchitis viruses iso¬ 
lated in ChinaG the SI gene of which was distantly re¬ 
lated to those of other viruses. 

2.3 Analysis of nucleotide sequences of S2 gene and 
deduced amino acid sequence 

The coding region of S2 protein of the two 
strainf] M41 and S14G was obtained by nucleotide se¬ 
quencing analysisG covering the nucleotides from the 
cleavage site of S precursor protein to the stop codon. 
S2 protein coding region of SI 4 strain was 1878 bp in 
lengthG which encodes 625 amino acids. The percent¬ 
age of G+ C was 35.52%. S14 strain had 13 N-gly- 
cosylation sites through analysis of glycosylation 
recognition signals. There was a hydrophobic region 
at position 550 to 590 in the C-terminalG which might 
be the connecting region of envelope and nucleocap- 
sid. Compared with the published sequence of M41 
strairG GenBank accession numberG M21883 GG there 
were many nucleotide mutations in S2 protein gene of 
SI4 strain. 

Coronavirus strains of Group I were closely relat¬ 
ed to H52G M41 and Holte. All Group II virusesG in¬ 
cluding published nephrogenic-type and proventricu¬ 
lus-type strainsG were isolated from China. S14 strain 
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was also in this group. Taken togetherD there was 
close genetic identity between Group I strains and 
Group II strainsQ while Group III viruses were repre¬ 


sentatives for American respiratory-type and vaccine 
straimQ Fig. 3D- 
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Fig. 2. Phylogenetic tree of SI gene. * IBV isolates from China. 



Fig. 3. Phylogenetic tree of S2 protein gene. * IBV isolates in China. 


2. 4 Analysis of nucleotide sequences of M protein 
gene and deduced amino acid sequence 

Nucleotide sequences and deduced amino acid se¬ 
quence of M protein of these two strains were further 
analyzed. The open reading frame of the M protein of 
SI4 strain was 678 bp in lengthD encoding 225 amino 
acids. There were two N-glycosylation sites at the N- 
terminal of SI4 strain. The first 11 amino acids of 
SI 4 strain were hydrophilic and were presumed as the 
outmembrane domain of M proteinD their downstream 
20 to 100 amino acids formed three hydrophobic re¬ 


gions and were considered as three transmembrane 
domains. 

In comparison with the published sequence of 
H52-GE0 GenBank accession numberG AY044184GD 
there were many nucleotide substitutions but no in¬ 
sertion or deletion in M protein gene of SI4 strain. 

Based on M gene dendrogramG Group I could be 
further divided into subgroups A and B. While JL1- 
97D GX1-98D DE072D HaN2-95D and HEN4-94 were 
closely related to H52G M41 belonged to subgroup A. 

D1466D H120D SAIBwjD SD1-97D SC2-93 and 
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Cal99D which had the highest genetic dentity with 
H120Q belonged to subgroup B. S14 belonged to 


group 11 ID and was also closely related to Groups I 
and II but distantly to Group IVD Fig.4D« 
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Fig. 4. Phylogenetic tree of M protein gene. * IBV isolates in China 


£2 . 5 Analysis of nucleotide sequences of N protein 

^gene and deduced amino acid sequence 


in 

a 


Nucleotide sequence and deduced amino acid se¬ 
quence of N protein gene of two strainsDM14 and 
OS14D were obtained by sequencing. The open reading 

^-l-H 

Sframe of N protein of S14 strain was 1230 bp in 
'SlengthD encoding 409 amino acids. There was one N- 
>glycosylation site at the N-terminal of S14 strainD 
^5 with another in the middle. N protein was hy- 
>drophilicDwhich may be beneficial for the interaction 


c2 

•^ i 

cS 


T3of N protein and RNA. 

T3 
O 


In comparison with the published sequence of 


^M4lD GenBank accession numberD M28566DD the N 


^protein gene of SI4 strain had nucleotide substitutions 
but no insertion or deletion. 

Based on the analysis of N protein geneD Group I 
could be further divided into subgroups A and BD of 
which JL1-97D SAIB14 and SAIBwj closely related to 
H52 belonged to subgroup A and SAIB20D GX1-98D 
GX2-98 which had the highest identity with Gray and 
Ark99 belonged to subgroup B. Group II could also 
be divided into C and D subgroupsD of which SC2-93D 
SD1-97 closely related to D1466 belonged to subgroup 
C and HaN2-95D GDI-87 which had the highest iden¬ 
tity with HI20 belonged to subgroup D. SI 4 strain 
had the highest identity with QXIBV and belonged to 
Group IIC Fig. 5Q. 


3 Discussion 

As an RNA virusD IBV is prone to genetic muta¬ 
tion and recombination during the process of replica- 
tionO resulting in the emergence of new mutant 
strains with different serotypes. ThereforeD it is diffi¬ 
cult to develop effective vaccines against IBV infec¬ 
tion. As a resultD avian infectious bronchitis still pos¬ 
es significant economic threat to the poultry industry. 
In our previous workD we obtained four coronavirus 
isolates from wild partridgeD one of which is designat¬ 
ed as Partridge coronavirus SI 4. 

Phylogenitic analysis of the whole genome of SI 4 
helped us find out the origin of the virus. By analyz¬ 
ing the structural protein gened Sid S2D M and ND 
of S14D SI 4 was clustered with Group If viruses by 
SI geneD the viruses in this group are all nephrogenic- 
type and proventriculus-type IBVs isolated from Chi¬ 
na. SI 4 had a high identity with nephrogenic-type 
strains JX1-99D TJ2-96D QXIBVD LX4 and BJ. 
While S2 gene of SI4 strain had a much higher genet¬ 
ic identity with the Chinese proventriculus-type 
strains QXIBV and LX4D which was 97. 2% and 
90.6%D respectively. They formed a subgroup II 
with another Chinese IBV isolate BJ. Phylogenetic 
analysis of M gene showed that SI 4 belonged to 
Group III and had a higher genetic identity with the 
Chinese isolates SAIB20D GD6-98 and LX4 than that 
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of BJ and QXIBV. Analysis of N gene grouped S14 to QXIBVD LX4 and BJ. 
Group III which included the Chinese IBV strains 



U It was considered that S2 and N genes of IBV 

( | ( 

°were conserved. In our researchG the S2 and N genes 
•^of SI 4 were closely related to the Chinese IBV isolates 
>such as QXIBVG LX4 and BJG so was the SI gene. 
^HoweverG it was not the case for M gene. Analysis of 
gene showed that the M gene of SI 4 was closely 
-grelated to IowaG SAIB20G GD6-98 and LX4 but very 
^distantly to BJ and QXIBV. Since SlG S2 and N 
'agenes of SI4 were closely related to those of Chinese 
oisolates QXIBV and BJG whose sequences were all 
distantly related to those of strains from other coun- 
triesG QXIBV and BJ might be the virulent isolates in 
China where SI4 was originated. As far as M gene 
was concernedG SI 4 was not closely related to vaccine 
strains H52 and H120 which were widely used in 
China. ThereforeG it was predicted that the evolution 
of SI 4 was also related to the mutation of vaccine 
strains. Some new IBV strains in China and other 
countries might be the recombinants of vaccine strains 
and field strainS 6 ^. Further study should be carried 
out to determine whether SI 4 was the recombinant of 


IBV isolates in China and introduced vaccine strains. 
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